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Abstract. Let H be a fixed graph on v vertices. For an n-vertex graph G 
with n divisible by v, an H- factor of G is a collection oin/v copies of H whose 
vertex sets partition V(G). 

In this paper we consider the threshold th[i(n) of the property that an 
Erdos-Renyi random graph (on n points) contains an ff-factor. Our results 
determine thu(n) for all strictly balanced H. 

The method here extends with no difficulty to hypergraphs. As a corollary, 
we obtain the threshold for a perfect matching in random fc-uniform hyper- 
graph, solving the well-known "Shamir's problem." 



1. Introduction 

Let H be a fixed graph on v vertices. For an n-vertex graph G with n divisible by 
v, an H- factor of G is a collection of n/v copies of H whose vertex sets partition 
V(G). (For the purposes of this introduction, a copy of H in G is a (not necessarily 
induced) subgraph of G isomorphic to H; but we will later find it convenient to 
deal with labeled copies.) We assume throughout this paper that v divides n. 

Let G(n,p) be the Erdos-Renyi random graph with edge density p. Recall that a 
function f(n) is said to be a threshold for an increasing graph property Q if 

Pr(G(n,p) satisfies Q) — ► 1 if p = p(n) = w(f(n)), and , . 

Pr(G(n,p) satisfies Q) — > if p = p(n) = o(/(n)). 

Of course if fin) is a threshold, then so is cf(n) for any positive constant c; nonethe- 
less, following common practice, we will sometimes say "the threshold for Q." 

Here we are interested in the (increasing) property that G{n,p) has an iJ-factor. 
We denote by thnin) a threshold for this property (where, to be precise, we really 
mean (1) holds for n = (mod v)). Determination of the thresholds thn is a 
central problem in the theory of random graphs, particular cases of which have 
been considered e.g. in [2, 13, 16, 19]. 

In this paper we come close to a complete solution to this problem, determining 
thff(n) for strictly balanced graphs (defined below), and getting to within a factor 
Tjot 1 ) f optimal in general. Our results generalize without difficulty to hypergraphs. 
Here the simplest case — namely, where H consists of a single hyperedge — settles 
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the much-studied "Shamir's problem" on the threshold for perfect matchings in 
random hypcrgraphs of a fixed edge size. (We will say a little more about this 
below, but for simplicity have chosen to give detailed arguments only for strongly 
balanced graphs and just comment at the end of the paper on how these arguments 
extend.) 

To get some feeling for the magnitude of thu (n), let us first consider lower bounds. 
If G contains an f/-factor, then each vertex of G is covered by some copy of H. So, 
if we denote by th^(n) the threshold for the property that in G(n,p) every vertex 
is covered by a copy of H, then 

th l j}(n) <th H {n). (2) 

For strictly balanced graphs (2) will turn out to give the correct value for thu- But 
for graphs in which some regions are denser than (or at least as dense as) the whole, 
we can run into the problem that, though the copies of H cover V(G), there is at 
least one vertex x of H for which only a few vertices of G play the role of x (in the 
obvious sense) in these copies. Formally we have 

(th% ] (n) <) th§(n) < th H (n), (3) 
where thjj (n) is the threshold for the property: 

• every vertex of G is covered by at least one copy of H, and 

• for each x € V(H), there arc at least n/v vertices x' of G for which some 
isomorphism of H into G takes x to x' . 

(We could replace n/v by Q(n) without affecting this threshold.) 

[21 

We believe th H tells the whole story: 
Conjecture 1.1. For every H, 

th% ] (n)=th H (n). (4) 



Though Conjecture 1.1 may already be a little optimistic, it does not seem impos- 
sible that even a "stopping time" version is true, viz.: if we start with n isolated 
vertices and add random (uniform) edges e\, . . . , then with probability 1 — o(l) we 
have an ff-factor as soon as we have the property described above in connection 
with th H . See [10] and [15, 4] for the analogous statements for matchings and 
Hamiltonian cycles respectively. 

We next want to say something about the behavior of thJ-jj and th)§ . For the issues 
we are considering the following notion of density is natural. Throughout the paper 
we use v(G) and e(G) for the numbers of vertices and edges of a graph G. 

Definition 1.2. For a graph H on at least two vertices, 
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and 



d*{H) = max d(H'). 



Definition 1.3. A graph H is strictly balanced if for any proper subgraph H' of 
H with at least two vertices, d(H') < d{H). 



(When the weaker condition d*(H) = d(H) holds H is said to be balanced.) 

Strictly balanced graphs are the primary concern of the present paper and are of 
considerable importance for the theory of random graphs in general; see for example 
[3, 12]. Basic graphs such as cliques and cycles are strictly balanced, as is a typical 
random graph. 

The thresholds th^j are known; see [12, Theorem 3.22]. In particular, for any H 
one has th^}(n) = Vl{n~ 1 / d ( H " l (\ogn) 1 / m ), where m is the number of edges in H. To 
see why this is natural, notice that the expected number of copies of H covering a 
fixed x € V(G(n,p)) is <d(n v ~ 1 p m ). We may then guess that (i) if this expectation 
is much less than log n then the number is zero with probability much more than 
1/n, and (ii) in this case it is likely that there are x's for which the number is zero. 

[21 

To specify th l H , we need a little notation. For v £ V(H) let 

d*(v, H) = max{d(H') :H'CH,ve V(H')} 

(the "local density" at v). Then clearly d*(v,H) < d*(H) for all v, with equality 
for some v. Let s v = mm{e(H') : H' C H,v e V(H'),d(H') = d*{v,H)} and let s 
be the maximum of the s„'s. (Similar notions enter into the determination of iftjy ; 
again see [12], noting that our uses of s v and s are not the same as theirs.) A proof 
of the following assertion will appear separately. 

Lemma 1.4. Ifd*(v,H) = d* W e V(H) then 

th% ] (n) =n- x l d ' \og 1/s n. 

Otherwise 

th l H ] (n)^n-^ d '. 



In particular we have 

th l H ] (n) = th l x ] (n) = n-WWQogn) 1 '™ (5) 
for a strictly balanced H with with m edges (see [22, 19]), and 

th ] §{n)>n- 1 ' d '^ (6) 

for a general H. 

Conjecture 1.1 says that the values for th l H ' given in Lemma 1.4 are also the values 
for thu- Some cases of the conjecture were known earlier: when H consists of a 
single edge, it is just the classic result of Erdos and Renyi [10] giving logn/n as 
the threshold for a perfect matching in G(n,p); and in [19, 2] (see also [12, Section 
4.2]) it is proved whenever d*{H) > 5(H) (the minimum degree of H). 
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The most studied case of the problem is that when H is a triangle, for which 
the conjectured value is 0(n _2 / 3 (logn) 1 / 3 ). This problem was apparently first 
suggested by Rucihski [20]; see [12, Section 4.3] for further discussion. Partial 
results of the form i/i#(n) < (3(n~ 2 / 3+£ ) were obtained by Krivelevich [16] (e = 
1/15) and Kim [13] (e = 1/18). 



2. New results 

The main purpose of this paper is establishing Conjecture 1.1 for strictly balanced 
graphs: 

Theorem 2.1. Let H be a strictly balanced graph with m edges. Then 
th H (n) = Q{n-^ d ^ H \logn)^ m ). 

For general graphs, we have 
Theorem 2.2. For an arbitrary H, 

th H {n)=0{n- 1 l d '^ +o ^). 

This is Conjecture 3.1 of [2]. Note that, in view of (6), the bound is sharp up to 
the o(l) term. 

We will actually prove more than Theorems 2.1 and 2.2, giving (lower) bounds on 
the number of ff-factors. Here, for simplicity, we confine the discussion to strictly 
balanced H . From this point through the statement of Theorem 2.4 we fix a strictly 
balanced graph H on v vertices and m edges (thus d(H) — m/(v — 1)). 

We use T{G) = T H {G) for the set of iJ-factors in G and $(G) = * ff (G) = \T(G)\. 
For the complete graph K n this number is 

{n/v)\\\Aut(H)\J y > 

where Aut(H) is the automorphism group of H. (Note: in general the expressions 
0(n), o(n) can be positive or negative, so the minus sign is not really necessary 
here.) Thus, recalling E denotes expectation, we have 

E*(G(n,p)) = n ^ n e-°^p mn ' v = e- oin \n v - 1 p m ) n ' v . (8) 
Strengthening Theorem 2.1, we will show that, for a suitably large constant C and 

p> Cn- 1/d(H \\ogn) 1/n \ 
< f>(G(n,p)) is close to its expectation: 

Theorem 2.3. For any C\ there is a C2 such that for any p > C^n -1 /^^ (log n) 1 / 7 ™ , 

$(G(n,p)) - e-° (n \n v - 1 p m ) n/v 
with probability at least 1 — n~ Cl . 
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Again note 0(n) can be positive or negative. The upper bound follows from (8) 
via Markov's inequality. Our task is to prove the lower bound, for which it will be 
convenient to work with the following alternate version. 

We say that an event holds with very high probability (w.v.h.p.) if the probability 
that it fails is n - ^ 1 ). We will prove the following equivalent form of Theorem 2.3. 
(The equivalence, though presumably well-known, is proved in an appendix at the 
end of the paper.) 

Theorem 2.4. For p = w(n _1 / d ( H )(logn) 1 / m ), the number of H-factors in G(n,p) 
is, with very high probability, at least e~°^ (n v ~ 1 p m ) n / v . 

As suggested above, the analogous extension of Theorem 2.2 also holds. 

Finally, we should say something about hypcrgraphs. Recall that a k-uniform 
hypergraph on vertex set V is simply a collection of fc-subsets, called edges, of V. 
Write Hk(n,p) for the random /c-uniform hypergraph on vertex set [n]; that is, each 
fc-set is an edge with probability p, independent of other choices. 

The preceding results extend essentially verbatim to fc-uniform hypergraphs with 
a fixed k. In particular, as the simplest case (when H is one hyperedge), we have a 
resolution of the question, first studied by Schmidt and Shamir [21] and sometimes 
referred to as "Shamir's problem": for fixed k and n ranging over multiples of 
k, what is the threshold for H.k(n,p) to contain a perfect matching (meaning, of 
course, a collection of edges partitioning the vertex set)? The problem seems to 
have first appeared in [9], where Erdos says that he heard it from E. Shamir; this 
perhaps explains the name. 

Shamir's problem has been one of the most studied questions in probabilistic com- 
binatorics over the last 25 years. The natural conjecture, perhaps first explicitly 
proposed in [7], is that the threshold is n~ fe+1 logn, reflecting the idea that, as 
for graphs, isolated vertices are the primary obstruction to existence of a perfect 
matching. Progress in the direction of this conjecture and related results may be 
found in, for example, [21, 11, 13, 7, 17]. Again, some discussion of the problem 
may be found in [12, Section 4.3]. 

We just state the threshold results for hypergraphs, the relevant definitions and 
notation extending without modification to this context. 

Theorem 2.5. For a strictly balanced k-uniform hypergraph H with m edges, 

th H {n) = etn-^^logn) 1 /™). 

Corollary 2.6. The threshold for perfect matching in a k-uniform random hyper- 
graph is 

e(n- k+1 \ogn). 
Theorem 2.7. For an arbitrary k-uniform hypergraph H, 

th H (n)=0(n- 1 / d *W+°W). 
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Again, the counting versions of these statements also hold (and are what's actually 
obtained by following the proof of Theorem 2.3). 

In the next section we give an overview of the proof of Theorem 2.4 and (at the 
end) an outline of the rest of the paper. 

Notation and conventions. We use asymptotic notation under the assump- 
tion that n — > oo, and assume throughout that n is large enough to support our 
assertions. We will often pretend that large numbers are integers, preferring this 
common abuse to cluttering the paper with irrelevant "floor" and "ceiling" symbols. 

For a graph G, V(G) and E(G) denote the vertex and edge sets of G, respectively, 
and, as earlier, we use v(G) and e(G) for the cardinalities of these sets. As earlier, 
for our fixed H we always take v(H) — v and e(H) — m. 

Throughout the paper V is [n] := {1, . . . , n} and, as usual, K n is the complete 
graph on this vertex set. We also use Kw for the complete graph on a general 
vertex set W. 

We use log for natural logarithm, lg for the indicator of event £, and Pr and E for 
probability and expectation. 

As noted earlier, we will henceforth use copy of H in G to mean an injection 
(p : V(H) -» V(G) that takes edges to edges (i.e. tp(E(H)) C E{G) with the natural 
interpretation of ip(E(H))). This avoids irrelevant issues involving |Aut(if)| and 
will later make our lives somewhat easier in other ways as well. Note that since this 
change multiplies $(G) by \Aut(H)\ n / v — e° (n \ it does not affect the statements 
of Theorems 2.3 and 2.4. Note also that we will still sometimes use (e.g.) K for a 
copy of H, in which case we think of a labeled copy in the usual sense. 

We use H{G) for the set of copies of H in G, H(x, G) = {K g H{G) : x g V(K)}, 
(where x € V(G)), and D(x, G) = \H(x,G)\. 



3. Outline 

In this section we sketch the proof of Theorem 2.4, or, more precisely, give the 
proof modulo various assertions (and a few definitions) that will be covered in 
later sections. For the moment we actually work with the model G(n, M) — that 
is the graph chosen uniformly from M-edge graphs on V — though in proving the 
assertions made here it will turn out to be more convenient to return to G(n,p). 
The relevant connection between the two models is given by Lemma 4.1, which in 
particular implies that Theorem 2.4 is equivalent to 

Theorem 3.1. For p = p(n) = w(n- 1 / d W(logn) 1 / ro ) and M = M(n) = (^)p, 



Pr($(G(n,M) > (n"- 1 p m )" /, 'e- 0(n) ) > 
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For the proof of this, let M = M(n) be as in the statement and T = (™) — M . 
Let ei,... , e^n\ be a random (uniform) ordering of E{K n ) and set Gi = if„ — 

{ei, . . . , e,} (so Go = if n ) and Fj = F(Gj) (recall this is the set of 7i-factors in 
Gi). Let & be the fraction of members of Fj_i containing e% (where we think of an 
iJ-factor as a subgraph of G in the obvious way). Then (for any t) 



\T t \ = l^o|{^| ■ ■ ■ yj^L = |^ | ( i - a) • • • (1 - 6), 



or 



Now 



logj^ t | =log!^ |+^log(l-6). (9) 



i=l 



(13) 



71 ' 17 — 1 

l0g| ^ 01 = l0g K#gi = ^T- lo g" ~ 0(n). (10) 

We also have 

(2) + 1 

since in fact 

Efe|ei,...,e i _i]=7i (12) 
for any choice of e\, . . . , ej_i. Thus 

t t /n\ 

E x - van , ( 9 ) 
E & = £^ = V lo g7^7 + o(1) ' 

i=l i=l ' V2^ c 

provided (2) — i > w(n). Let At (= At{n)) be the event 

t 

{log|F t | >log|F |-]T7-O(n)}. 

Our basic goal is to show that failure of At is unlikely. Precisely, we want to show 

fort<T, Pr(A t ) = . (14) 

This implies Theorem 3.1, since if At holds then (10) and (13) (the latter with 
t = T) give 

V — 1 771 72 

\og^(Gr) — log I .Ft I > nlogn H logp — 0(n). 

v v 



The proof of (14) uses the method of martingales with bounded differences (Azuma's 
inequality). Here it is natural to consider the martingale 

t 

*t = J>-7i) 
»=i 

(it is a martingale by (12)), with associated difference sequence 



8 



ANDERS JOHANSSON, JEFF KAHN, AND VAN VU 



Establishing concentration of Xt depends on maintaining some control over the 
| Zi|'s, for which purpose we will keep track of two sequences of auxiliary events, B% 
and IZi (1 < i < T — 1). Informally, Bi says that no copy of H is in much more 
than its proper share of the members of Ti, while TZi includes regularity properties 
of Gi, together with some limits on the numbers of copies of fragments of H in Gj. 
The actual definitions of Bi and TZi are given in Section 8. 

For i < T it will follow easily from Bi-\ and 7\U_i (see Lemma 8.2) that 

Ci = o(log- 1 n). (15) 

(The actual bound on £j implied by Bi-\ and TZi-\ is better when i is not close to 
T, but for our purposes the difference is unimportant.) 

The bound (15), while (more than) sufficient for the concentration we need, can 
occasionally fail, since the auxiliary properties Bi-i and IZi-i may fail. To handle 
this problem we slightly modify the preceding X's and Z's by setting 

_ f & — 7i if Bj and TZj hold for all j <i , 
At ~ \ otherwise (W) 

(and X t = £- = i Zi). 

As shown in Section 7, a martingale analysis along the lines of Azuma's Inequality 
then gives (for example) 

Prfl-Xtl >n) <n" w W. (17) 

Notice that if we do have Bi and TZi for i < t < T (so that X t — X^i=ite — 7»)) 
and |X t | < O(n) (it will actually be smaller) then we have At, since: 

t t 

i=l i=l 

whence (noting we have (15) for z < t) 

2^<oOog _1 n)E^ <0 ( n )' 

i=l i=l 

so that (using (9)) 

t t 

log |^| > log |^o| - + C 2 ) > log |^o| - - O(n). 

i=l i=l 



Thus the first failure (if any) of an At (with t < T) must occur either because X t 
is too large or because one of the properties B, 1Z fails even earlier (that is, Bi or 
TZi fails for some i < t). Formally we may write 



i<t i<t i<t 
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Here the second sum is, in view of the preceding discussion and (17), at most n 
and we will show, for i <T, 

Pr(£i) < n~ u W (18) 

and 

Pr(AK,B,)<u- u(1) . (19) 
Combining these three bounds gives (14) and Theorem 3.1. □ 

The rest of the paper is organized as follows. The next few sections (4-6) fill in 
some general results, and, as already mentioned, Section 7 gives the calculation 
needed for (17). In Section 8 we finally define the properties Bi and IZi, and 
slightly restate our main inequalities, (18) and (19). Proofs of these — the heart 
of our argument — are given in Sections 9-11. Extensions to general graphs and 
hypergraphs are discussed in Section 12. Finally, Section 13 gives the equivalence 
of Theorems 2.3 and 2.4. 

4. Models of random graphs 

As noted earlier, we go back and forth between the models G(n,p) and G(n, M). 

We recall the definitions, which were already used above: G(n,p) is the random 
graph on vertex set [n] gotten by taking each pair of vertices to be an edge with 
probability p, independently of other choices; and G(n, M) is the random graph on 
[n] gotten by choosing M edges uniformly from E(K n ). 

The following observation allows us to switch between the two models. (Note that 
is the expected number of edges in G(n,p).) 

Lemma 4.1. Let n n W = M < (™) be an integer, and p = M/("). 

(a) If an event £ holds with probability at least 1 — e in G(n,p), then it holds with 
probability at least 1 — 0(ne) in G(n, M). 

(b) If an event £ holds with probability at least 1 — e in G(n, M'), for all M/2 < 
M' < 2M, then it holds with probability at least 1 — e -n-^ 1 ) inG(n,p). 

Notice that the error terms in the conclusions are not the same as those in the 
hypotheses. For our purposes the extra factors will not matter, since we will always 
be dealing with failure probabilities of order n~ u ^ . 

Proof of Lemma 4.1. The number of edges in G(n,p) has the binomial distribution 
Bin(( ? 2 l ),p). Claim (a) follows from the fact that this number is equal to M with 
probability f2(l/n), and claim (b) from the fact that it is between M/2 and 2M 
with probability 1 — n^^^K 

Both these facts are simple properties of the distribution Bin(("),p) forp as in the 
lemma. 
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5. Concentration 

Let / = f(ti, . . . ,t n ) be a polynomial of degree d with real coefficients. We say 
that / is normal if its coefficients are positive numbers and the maximum coefficient 
is 1. We will be interested in the situation where the Ws are independent Bernoulli 
random variables (which will be i.i.d. in our applications, but need not be for the 
results stated here). Then / is itself a r.v., and we will want to say that, under 
appropriate conditions, it is likely to be close to its mean. 

All polynomials needed for the proof of Theorem 2.4 are multilinear, that is, of 
the form f{t) = ^ J a u t u , where U ranges over subsets of [n] and t v := Y\.ueu^ u - 
So for simplicity we confine the present discussion to such polynomials, though the 
results hold in greater generality. 

For a (multilinear) polynomial / as above and L C [n], the partial derivative 
(of order \L\) with respect to the variables indexed by L is X)toL a u^u\Li an d 
its expectation, denoted or E^/, is ^2{cvu Y[ieu\L P* • ^ — wnere U ~ 
Ber(pi). In particular, when p i — p\/i we have Yl,{ a uP [U ^ L ^ '■ U 2 L}. Set Ejf = 
max| L | =i E L f. 

It may be helpful to observe that when, as here, / is multilinear, we don't really 
need the polynomial language: We have a weight function a on subsets of [n] and 
are choosing a random subset, say T, of [n], with membership in T determined by 
independent coin flips. Then t is the indicator of T; f(t) is the (total) weight of 
the surviving sets (i.e. those contained in T); and E^f is the expected weight of 
the surviving sets containing L, given that L itself survives. 

We begin with a special case of the main result of [14]. 

Theorem 5.1. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, homogeneous, normal polynomial of degree d 
such that E/ > n e maxi^x^ Ejf. Then 

Pr(\f-Ef\>eEf) = n-»W. 

Note that (here and below) d and e are fixed, and expressions such as w(l) are 
interpreted as n — > oo. 

Remark 5.2. Though we have stated the results in this section for normal polynomi- 
als, they clearly remain true if we instead assume some fixed bound on coefficients — 
in this case we say a polynomial is 0(l)-normal — and we will sometimes use them 
in this form. 

When / is normal, Edf = 1; so in order to use Theorem 5.1 we must have Ef = 
il(n £ ) (if one follows [14] more closely, then Ef = u((logn) 2d ) is sufficient). Thus, 
the theorem is not applicable when, for example, Ef is on the order of log n, a range 
that will turn out to be crucial for us. In such ranges the following theorem, which 
is essentially (the multilinear case of) Theorem 1.2 of [23], is useful. 
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Theorem 5.3. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, normal, homogenous polynomial of degree d 
such that Ef = w(logn) and maxi<j<d-i Ejf < n" £ . Then 

Pr(|/-E/| > £ E/) = n-«. 

(Note: Theorem 1.2 of [23] asks that Ef be o(n), but this requirement is superflu- 
ous: since / is normal, Ef < n d ; so we can add, say, n d+1 dummy variables that 
don't appear in / to guarantee Ef = o(n), and then apply the theorem as stated. 
Also, to be very picky, the statement in [23] is in Unitary form — for any given (3 the 
probability is less than n _/3 if n/Q > Ef > Qlogn for a large enough Q = Qp — but 
this clearly implies the present version.) 

The key difference between Theorem 5.3 and Theorem 5.1 is that here we only 
need to consider partial derivatives of order up to d— 1. While this may seem to be 
a minor point, the proof of Theorem 5.3 is considerably more involved than that of 
Theorem 5.1. 

Our basic concentration statement is the following combination of Theorems 5.1 
and 5.3. 

Theorem 5.4. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, homogeneous, normal polynomial of degree d 
such that Ef = w(logn) and maxi^xd-i Ejf < n~ € Ef. Then 

Pr(|/-E/| >dE/) = n-«. 

Corollary 5.5. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, homogeneous, normal polynomial of degree d 
with Ef < A, where A (= A(n)) satisfies 

A > w(logn) + n e max Ejf, 

0<j<d 

then 

Pr(/> (l + e)A) <n-" (1) . 

Proof (sketch). If Ef > A/2 then this is immediate from Theorem 5.4. Otherwise 
we can augment / to some g > f for which we still have Eh < A and g satisfies 
the hypotheses of Theorem 5.4. (For instance, we can do this with g = g(t, s) = 
f(t) + h(s), where h(s) = h(si, . . . , s n ) — 7^{ s t r : U C [n], \U\ = d} with the Sj's 
all Bcr(p) (and different from the ti's), and 7 chosen so that Eh = A/2.) □ 

We also need an inhomogeneous version of Theorem 5.4. Write E' L — E' L f for the 
expectation of the nonconstant part of the partial derivative of / with respect to 
L. Of course for / homogeneous of degree d and < \L\ < d as in Theorem 5.4, we 
have E'J = E L f. 

Theorem 5.6. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, normal polynomial of degree at most d, with 
Ef = w(logn) and max L -£0 E' L f < n~ £ Ef. Then 

Pr(|/-E/|> £ E/)<n-«. 
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Proof (sketch). We apply Theorem 5.4 to each homogeneous part of /. Say the 
fcth homogeneous part is f k . If Ef k > E//2 then Theorem 5.4 gives Pr(|/fe — Ef k \ > 
(s/d)Ef) = n~ w< - 1 ) (since Ef k = 0(E/)). Otherwise we can, for example, introduce 
dummy variables for 1 < i < \Ef~\ and j S [k], each equal to 1 with probability 
1, and let g k = J^i I\j z ij an d h k = fk + 9k- Theorem 5.4 (applied to h k , for which 
we have Eh k = 6(E/)) then gives 

Pr(|/ fc - Efk\ > £ -Ef) = Vv{\h k -Eh k \> E -Ef) = n~ u W. 

(The number of variables used for h k is not exactly n, but cannot be big enough to 
make any difference.) Consequently, we have 

d 

Pr(|/ - Ef\ > sEf) < £Pr(|/ fc - Ef k \ > Uf) = n^l 

k=i a 

□ 

The corresponding generalization of Corollary 5.5 is 

Corollary 5.7. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, normal polynomial of degree at most d with 
Ef < A, where A = A(n) satisfies 

A > lo (log n) + n £ max E' L f. 

Then 

Pr(/ > (I + e)A) = n-^K 

Finally, we will sometimes need to know something when Ef is smaller. Here we 
cannot expect that / is (w.v.h.p.) close to its mean, but will be able to get by with 
the following weaker statement, which is (in slightly different language) Corollary 
4.9 of [24]. Note that in this case our hypothesis includes E®f (= Ef). 

Theorem 5.8. The following holds for any fixed positive integer d and positive 
constant e. Let f be a multilinear, normal polynomial of degree at most d with 
max £ E' L f < n~ £ . Then for any (3{n) = w(l), 

Pr(/ > (3(n)) =n- w W. 

For a random graph G(n,p), we use t e for the Bernoulli variable representing the 
appearance of the edge e; thus t e is 1 with probability p and with probability 
1 - p. Recall that for S C E{K n ), t s := Yle^s 

To get a feel for where this is headed, let us briefly discuss a typical use of the 
preceding theorems. Consider a strictly balanced graph H with v vertices and 
m edges, and suppose we are interested in the number of copies of H in G(n,p) 
containing a fixed vertex xq. This number is naturally expressed as a multilinear, 
normal, homogeneous polynomial of degree m in the t e 's: f — J2u^u, where U 
runs over edge sets of copies of H in K n containing x . Clearly Ef = Q(n v ~ 1 p m ). 

Now consider a non-empty subset L of E(K n ). The partial derivative El/ is 
J2udl^u\l- Let H' be the subgraph of K n consisting of L and those vertices 
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incident with edges of L, and let v' and ml (= \L\) be the numbers of vertices and 
edges of if'. Then E L f is 0(n v - v ' p m ~ m ') if L covers xo and 0(n v - v '- 1 p m - m ') if 
it does not. In either case 

E//E L / - fifr"'" 1 ?" 1 '). 

Since if is strictly balanced, (V — l)/m' > (v — l)/m. This implies that if E/ = 
n(l) (that is, i£p= n(n- ( - v - 1 '>/ m ) then Ef/E L f > n^ 1 ), a hypothesis for most 
of the theorems of this section. (This discussion applies without modification to 
hypergraphs, since it makes no use of the fact that edge are of size two.) 

6. Entropy 

We use H{X) = H e (X) for the base e entropy of a discrete r.v. X; that is, 

2f(*) = 5>(*)tog-^ 

X 

where p(x) = Vi(X = x). (For entropy basics see e.g. [8] or [18].) 

Given a graph G and y e V{G), we use X(y,G) for the copy of if containing y 
in a uniformly chosen if -factor of G, and h(y,G) for H(X(y,G)). (We will not 
need to worry about G"s without if-factors.) The next lemma is a special case of 
a fundamental observation of Shearer [6] . 

Lemma 6.1. For any G, 

log$(G)<i J2 h ^°)- 

yeV(G) 

(To be precise, the statement we are using is more general than what's usually 
referred to as "Shearer's Lemma," but is what Shearer's proof actually gives; it 
may be stated as follows. Suppose Y — (Yj : i € I) is a random vector and S a 
collection of subsets of I (repeats allowed) such that each i € I belongs to at least 
t members of S. Then HiY) < t^ 1 J2 SeS H(Ys), where Ys is the random vector 
(Yi : i € S). To get Lemma 6.1 from this, let Y be the indicator of the random 
if-factor (so I is the set of copies of if in K n ) and S = (S v : v e V), where S v is 
the set of copies of if (in K n ) containing v.) 

For the next lemma, S is a finite set, w : S — > 3? + , and X is the random variable 
taking values in S according to 

Pr(X = x) = w(s)/w(S) 

(where for ACS, w(A) = J2 x eA w ( x ))- ^ or perspective recall that for any r.v. X 
taking values in S, one has H(X) < log \S\ (with equality iff X is uniform from S). 

Lemma 6.2. If H{X) > log IS] — 0(1), then there are a, b G range(w) with 

a<b< 0(a) (20) 
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such that for J = w [a, b] we have 

\J\ = Q(\S\) 

and 

w(J) > .7w(S). 

Proof. Let 

H(X)=log\S\-K (21) 

and define C by logC = 4(K + log 3). With w = w(S)/\S\, let a = w/C, b = Cw, 
L = w- 1 ([0,a)), U = w- 1 ((b,oo]), and J = S \ (L U U). We have 

± log3 + W) log 1x1 + W) log 1 J| + W) loslul (22) 

Then we have a few observations. First, \U\ < \S\/C implies that the r.h.s. of (22) 
is less than 



log 3 + log \S\-^ log C, 



which with (21) implies 



w(CT-) < K , + l ° r g3 w(S) = w(S)/4. (23) 
logo 

Of course this also implies \U\ < \S\/4:. 

Second, combining (23) with the trivial v/(L) < w(S)/C, we have (say) w(J) > 
.7w(S). But then (third) since the r.h.s. of (22) is at most 

log 3 + log \S\ + ^ log M < log 3 + log \S\ + .7 log 

w(b) \b\ \b\ 

we have 

| J| > exp[-(.7)- 1 (^ + Iog3)]|S| (= Sl(\S\)). 



7. Martingale 

Here we give the proof of (17). As in Section 3, let 

X t = Z 1 + --- + Z t , 
where we use the modified Z^s of (16). We first prove that 



Pi(X t > n) < n'"^. (24) 

Notice that Zi is a function of the random sequence ei, . . . , ej, with E(Zj|ei, . . . , ej_i) 
for any ei, . . . , e^_i and (relaxing (15) a little) < e := log -1 n. By Markov's 
inequality, we have, for any positive h, 
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Pv(X t >n)= Vi{e h{ - Zl+ - +Zt ^ > e hn ) < E(e h(Zl+ - +z ^)e- hn . (25) 

Next we bound £( e h (Zi+---+z t )y g incc z . _ £ . _ ^ E(&|ei, . . . , e;_i) = -fi and 
< £i < e, we have (using convexity of e x ), 

E(e hz *\e u . . . , e^O < e"^((l - J) + |e fe ). 

A simple Taylor series calculation shows that the right hand side is at most e h ^ eii , 
for any < h < 1. Thus, for such /i 

E(e hz '|ei,... .ei-O^e" 2 ^, 

and induction on t gives 



£{e Hz 1+ -+z t)) = E [E(e^+-"+ z *)| ei ,,... ..et-O] 

= E[e fc ^ + - +z *-)E(e fcz *|e 1 , ) ...,,e t _ 1 )] 

< E[e /l(Zl+ '" +Zt - l) e' l2e7t ] 

< e^EL,^. 

Inserting this in (25) we have 

Pr(A t >n)< e ft2£ E* =1 7i-^. 

Since J2i=i 7» = O(nlogn) and e = log -1 n, if we set h to be a sufficiently small 
positive constant, the right hand side is e~ n ^ = n - ^ 1 ), proving (24). (Of course 
we could do much better — for Pr(|X| > A) < rT^^, A > Q(y/n logn) is enough — 
but this makes no difference for our purposes.) That Pr(X n < —n) < n - ^ 1 ) is 
proved similarly, using —X t in place of X t . (Actually we only use the bound on 
Pr(A„>n).) 

8. The properties B and TZ 

In this section we define the properties Bi and Hi and observe that in dealing with 
(18) and (19) we can work with G(n,pi) rather than Gi, where 




We will actually define properties B and 1Z(p) (p € [0, 1]); the events Bi and IZi 
are then 

{d satisfies B} 
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and 

{d satisfies lZ(pi)}. 

We need some notation. For a finite set A and w : A — > 3? + (:= [0, oo)), set 

w(A) = |A|- 1 ^w(a), 

aeA 

maxw(A) = maxw(a), 

aeA 

and 

maxr w(A) = w(A)^ 1 maxw(A) 
(the maximum normalized value of w), and write medw(^4) for the median of w. 

For a graph G on vertex set V and Z E Q, let w G (Z) = $(G-Z). Thus, w G (Z) 
denotes the number of ii-factors in the graph induced by G on the vertex set V\Z. 
We also use w G {K) = w G (V(K)) for K e H{G). 

The property B for a graph G is 

B(G) = {maxr w G (H(G)) = 0(1)}. (26) 

Thus, B(G) asserts that for any copy K of H in G, the number of ii-factors 
containing K is not much more than the average. 

There are two parts to the definition of 1Z(p). The first refers to the following 
setup, in which (V = [n] and) expectations refer to G{n,p). 

Given A C V(H), E' C E(H)\E(H [A]), ^ an injection from A to V, and G C if n , 
let X(G) be the number of injections ip : V(H) — > V with 

95 = V on A (27) 

and 

xy G £' ¥>(*)¥>(!,) G (28) 

Write X(G(n,p)) in the obvious way as a polynomial in variables t e = l{ e eE(G(n, P ))}, 
eeE(K n ): 

X(G(n,p)) = h(t) = Y,tv(E>h ( 29 ) 

<p 

where t = (t e : e € E(K n )) and the sum is over injections <p : V(H) — ► satisfying 
(27). Then /i is multilinear, 0(l)-normal and homogeneous of degree d = |£"|, and 
we set 

E* = max{E L h : |i| < d}. (30) 
(Note this includes L = 0, corresponding to Eh.) 

For the second part of the definition, we use D(p) for the expected number of 
copies of H in G(n,p) using a given x € V; that is, 

D(p) = v(n - l) v - lP m = 6(n"-y). 
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Definition 8.1. We say G C K n satisfies lZ(p) if the following two properties hold. 

(a) For A, E' and tp (and associated notation) as above: if E* = n~ n ^ , then for 
any f3(n) = u>(l), X(G) < f3(n) for large enough n; if E* > n~ olyl \ then for any 
fixed e > and large enough n, X(G) < n e E*. 

(b) For each xeV, \D(x, G) - D(p)\ = o(D{p)) 

Recall D(x, G) is the number of copies of H (in G) containing x. Thus (b) says 
that for any x, the number of copies of H containing x is close to its expectation. 

We pause for the promised 

Lemma 8.2. Fori <T (T as in Section 3), andlZi-i (that is, B and1Z(pi-\) 
for Gi-i) imply (15). 

Proof. Write w for wcj_ r We first observe that B and 1Zi-\ imply that, for any 
KeH(Gi-!), 

w(*-)/*(Gi_i) = 0(1/D(pi_i)) (31) 
(the left side is the fraction of H- factors in Gj_i that use K), since 

<&(£;_!) = -w(W(Gi_i)) 

n 

= -fi(|W(G<_i)|maxw(W(G j _i))) 
n 

(using B in the second line and part (b) of IZi-i in the third). 

On the other hand, a simple application of part (a) of IZi-i shows that, for any 
e e E(Gi-i), the number of K e W(Gj_i) containing e is at most /3 for some 
[3 — (3(n) satisfying (3~ 1 D(p i -i) = w(logn). (Here we need the observation that 
for i <T, D(pi_i) = w(logn).) Combining this with (31) we have (15). □ 

We also need the "p- version" of At- 

t 

A(p) = {log\f(G)\ > log |^ |- ^ 7i -0(n)}, (32) 

i=l 

where t = \(1 — p)^)] ( an d 7» is as in (11)). 

According to Lemma 4.1, (18) and (19) — and thus, as noted at the end of Section 3, 
Theorem 3.1 — will follow from the next two lemmas. 

Lemma 8.3. For p > w^-^^pogre) 1 /™), 

Pr(G(n,p) satisfies K(p)) = 1 - 

(The assumption on p is only needed for (b) of Definition 8.1.) 
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Lemma 8.4. For p > ^(^/^(logn) 1 /" 1 ), 

Pr(G(n,p) satisfies A(p)K(p)B) = 

These are proved in Section 9 and Sections 10 and 11 respectively. 

9. Regularity 

Here we verify Lemma 8.3; that is, we show that w.v.h.p. G = G(n,p) satisfies (a) 
and (b) of Definition 8.1. 

For (a), since there are only possibilities for A,E',ip, it's enough to show 

that for any one of these the probability of violating (a) is n^^^K This is imme- 
diate from the results of Section 5: recalling that each (A, E' , ip) corresponds to a 
homogeneous, multilinear, 0(l)-normal polynomial h as in (29) and E* given by 
(30), we find that the first part of (a) is given by Theorem 5.8, and the second by 
Corollary 5.5 with A = ^n £ E* (and any e smaller than the present one). 

For (b), write D(x,G) in the natural way as a polynomial of degree m (recall 
m = \E(H)\) in the variables t e = l{ ee E(G)} ( e <= E(K n )); namely (with t = (t e : 
eeE(K n ))), 

D(x, G) = f(t) := J2ltK : K e H (x)}, 
where Ho(x) is the set of copies of H in K n containing x (and, as in Section 5, 

We have 

Ef = e(n v - 1 p m )=u(logn), (33) 

and intend to show concentration of / about its mean using Theorem 5.4; thus we 
need to say something about the expectations Ex,/. 

For L C E(K n ) with 1 < \L\ = I < m we have 

E L f= P m - l N(L), 

where N(L) is the number of K e H (x) with L C E(K). Let W = V(L) U {x}, 
with V(L) C V the set of vertices incident with edges of L, and v 1 = \W\. Then 
N(L) = Q(n v ~ v ) if the graph H' := (W, L) is (isomorphic to) a subgraph of H, 
and zero otherwise. 

Thus, in view of (33), we have 

E//Ex/ = n(n"'-V) = fX" [(t/ ~ 1)/Mt, ~ 1)/m1 ') 

= n( n ll/d(H')-l/d(H)]^ = n fi(l) 

(where we used strict balance of H for the final equality). 

Combining this with (33), we have the hypotheses of Theorem 5.4, so also its 
conclusion, which is (b). 
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10. Proof of Lemma 8.4 

We now assume p is as in Lemma 8.4 and slightly simplify our notation, using G, 
1Z and A for G(n,p), TZ(p) and A(p). Events and the notation "Pr" now refer to 
G; so for instance Pr(«4) is the probability that G satisfies A. 

We will get at B via an auxiliary event C. Write 7io for the collection of v-subsets 
of V, and, for Y C V with \Y\ < v, H (Y) for {Z £ H a : Z D Y}. We extend the 
weight function w = wq to such Y by setting 

W (Y) = J2MZ) : Z eH (Y)}. 

Thus w(Y) is the number of "partial £f-factors" of size - — 1 in G — Y. The property 
C for G is 

for any Y e ( maxw(W (Y)) < max^" 2 ^" 1 )^), 2med w(W (Y))} 

(34) 

(where, recall, $(G) is the number of 7J-factors in G). 

Lemma 8.4 follows from the next two lemmas. 
Lemma 10.1. Pt(ATZC) = n~ w W 
Lemma 10.2. Pr(KCB) = n"^ 1 ) 

The proof of Lemma 10.1, which is really the heart of the matter, is deferred to the 
next section. Here we deal with Lemma 10.2. We first need to say that C implies 
that maxr w(Ho) = 0(1), or, equivalently, that for some positive constant 5, 

\{K G Ho : w(K) > (5maxw(H )}| = n(|Wo|) (= fl(n v )). (35) 

This is an instance of a simple and general deterministic statement that we may 
formulate (more precisely than necessary) as follows. Let V be any set of size n 
and w : (^) — > For ICFof size at most v write Hq(X) for the collection of 
w-subsets of V containing X, and for simplicity set ip(X) = maxw(Ko(l)). Let B 
be a positive number. 

Lemma 10.3. Suppose that for each Y C V satisfying \Y\ = v — 1 and ip(Y) > B 
we have 

I u — 7; 

\{ZeH a (Y): W (Z)>-^(Y)}\>—. 
Then for any X C V with \X\ = v — i and ip(X) > 2 l ^B we have 

\{ZeH (X):w(Z)>^(X)}\> (!LrHy_J_ . (36) 

Proof. Write Ni for the r.h.s. of (36). We proceed by induction on i, with the case 
z = l given. Assume X is as in the statement and choose Z € Hq(X) with w(Z) 
maximum (i.e. w(Z) = ip{X)). Let y e Z \ X and Y = X U {y}. Then \Y\ = v - 
(i — 1) and ip(Y) — ijj(X) > 2 t ~ 1 B (> 2 l ~ 2 B); so by our induction hypothesis there 
are at least AVi sets Z' e H (Y) with w(Z') > 2-( J - 1 V(r) (= 2-( J - 1 ) ^(J)). For 
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each such Z> ', Z' \ {y} is a (v - l)-subset of V with ip{Z' \ {y}) > w(Z') > B. So 
(again, for each such Z') there are at least (n — v)/2 sets Z" e Ho(Z' \ {y}) with 

w(Z") > W \ {y})/2 > 2"V(I). 

The number of these pairs (Z',Z") is thus at least A^_i(n — u)/2. On the other 
hand, each Z' associated with a given Z" is Z"\{u}U{y} for some u e Z"\(XL){y}); 
so the number of such Z' is at most i — 1 and the lemma follows. □ 

Proof of Lemma 10.2. Set 7 = [2 v+1 (v - l)!]" 1 and 6 = 2~ v . Now C implies the 
hypothesis of Lemma 10.3 with B = (2n)~^ v ~ 1 ^(G) (actually with any B greater 
than n -2 ^ -1 )), and we have (trivially) 

V>(0) > rT^-V^G) = 2 V - 1 B. 

Thus (36) applies, yielding (35), now with constants specified: 

\{K e Ho : w{K) > (5maxw(W )}| > yn v . 

Let J be the largest power of 2 not exceeding maxw and 

Z = {ZeH : w(Z) > 5 J}. 

For X C V with \X\ < v let Z(X) = {Z E Z : X C Z}, and saylCV with 
|X| < w is goorf if |Z(X)| > 7 n t '-l x l. Thus in particular, 

the empty set is good (37) 

(as are all singletons, but we don't need this). 

Fix an ordering a\, . . . ,a v ofV(H). For distinct x\, . . . ,x r EV, write S(x\, . . . , x r ) 
for the collection of copies ip of H in K n for which 

ip(ai) = Xi for i e [r], 

ip(a,i)<p(a,j) € E(G) whenever i,j >r and ajdj e E(H), 

and p(V(ff)) E Z. 

For r e {0, , . . . , v} let iV r = N(a r ) n {a r+ i, . . . , a„} and d r = |iV r |. (In particular 
d v = 0.) Let 3^1, • • ■ , x r ) be the event 

{\s( Xl ,... ,x r )\=n( P d <-+-+ d —n v - r )}. 

In particular 

S*(0) = {p e H(G) : w(^(V(fl"))) > 5 J} 

and 3^(0) is the event 

{\sm\ = n( P m nn}. 

For (distinct) x\, . . . , x r e V let <2(xi, . . . , x r ) be the event 
{{xi, ... ,x r } is good} A y(a;i, . . . ,x r ). 

Since BC C Q(0) (by (37)), Lemma 10.2 will follow if we can show 

Pr(ftQ(0)) = n~ w(1) . 
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For inductive purposes we will actually prove the more general statement that for 
any r and x\, . . . ,x r , 

Pr(TZQ(xi, ... , x r )) = n-^l (38) 

Proof of (38). We proceed by induction ono-r. The case r = v being trivial (since 
for w-subscts of V being good is the same as belonging to Z), we consider r < v, 
letting X = {xi, . . . , x r }. 

Let V be the event 

{y € V\X,XU{y} good =► y(x u ... ,x r ,y)}. 
By inductive hypothesis it's enough to show 

Pr(nVQ{ Xl ,... ,x r )) <n~ u ^ (39) 
(since Vr{TZQ) < Pt(TZVQ) + Pr(TZV) and, by induction, Pi(KV) = n - u ^). 

Note that if X is good then 

|{y:XU{y}good}|=n(n). (40) 

We need to slightly relax 1Z to arrange that the edges between x r and V \ X 
arc independent of our conditioning. Say G satisfies IZx if it satisfies (a) in the 
definition of 1Z (= TZ(p)) whenever A — {a\,... ,a r }, ip{ai) — Xi (i <E [r]) and 
E'CE(H-A). 

If VSP A {X good} holds but y{x\, . . . , x r ) does not, then we have the following 
situation. There is some J = 2 k with k an integer not exceeding nlogn (see (10)) 
so that (with Z, "good" and other quantities as in the preceding discussion, but 
now defined in terms of this J) 

(i) Tlx holds; 

(ii) there are at least Q(n) y's in V \ X for which we have y(x\, ... ,x r ,y) 
(by (40)); but 

(iii) y(xi, . . . ,x r ) does not hold. 

Note that, for a given J, properties (i) and (ii) depend only on G' := G — X. Since 
the number of possibilities for J is at most nlogn, it is thus enough to show that 
for any J and G' satisfying (i) and (ii) (with respect to J), 

Pr(y(a5i,..., a;r )|G , )=n" h ' (1) - ( 41 ) 

Given such a G' (actually, any G'), \S{x\,... ,x r )\ is naturally expressed as a 
multilinear polynomial in the variables 

tu ■— ~L{x r ueE{G)} UEV\X] 

namely 

\S(x u ... ,x r )\ =g(t) :^^a u t u , 

u 
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where U ranges over d r -subsets of V \ X, and a v is the number of copies ip of 
K := H — {ai, . . . , a r } in G' with 

i>(N r ) = u 

and 

^{{a r+u ... ,a v })UX eZ. (42) 
In order to apply Theorem 5.4 we normalize and consider 

f(t)=a- 1 g(t), 

with a the maximum of the a u , s. We then need the hypotheses of Theorem 5.4; 
these will follow (eventually) from IZx ■ 

We may rewrite 

9(t)= ^2{ t v(N r )-V^S(x 1 ,...,x r ,y)}. 
y ev\x 

Noting again that the variables t u (u e V \ X) are independent of G' (which 
determines the sets S(x\, . . . ,x r ,y)), and using property (ii), we have 

E.9=/" J2 \S(x u --- ,x r ,y)\^n(p d ^-+ d ^n v - r )- (43) 
yev\x 

Note that if d r = then there is nothing random at this stage and we have 

1^!,...,^)!= ]T \S(x u ... lXr ,y)\ = Q(p d ^-+ d ^n v - r ), 
yev\x 

which is what we want; so we may assume from now on that d r > 0. 

If we set H' = H—{ai, . . . ,a r _i} thcnc? r + - • -+d v -i = e(H') and u-r = v(H') — l, 
so that the r.h.s. of (43) is Q(p et - H ''>n v(H '^ 1 ). Thus, since H is strictly balanced, 
we have 

,_ f w(logn) if r = 1 . . 

E - 9= { >) ifr>l. (44) 

We are going to prove that 

E/ = w(logn) (45) 
(in most cases it will be n^' 1 ') and 

max{E T / : T C V \ X, < \T\ < d r } = Ef. (46) 

If we have these then Theorem 5.4 says that w.v.h.p. / and (therefore) g are close 
to their expectations, which in view of (43) is what we want. 

For the proofs of (45) and (46), it will be more convenient to work with the partial 
derivatives of g than with those of /. As in Section 5 we use t e = l{ ee E(G)}, 
ts = Ilees *e and 

t = (t e : e e E{K V \ X )) (47) 
(where, recall, K v \ x is the complete graph on V \ X). 
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Let T QV\X with < I := \T\ < d r . (Note we now include I = d r , in which case 
Erg is just ar-) 

Since we are only interested in upper bounds on the partial derivatives of g, we 
may now disregard the requirement (42); thus we use 

p-(dr-l) ETg < h[t) :=Y^t v (E(K )h (48) 

where the sum is over injections 

if : V(K) —>V\X with <p(N r ) D T (49) 
(and, again, with the obvious meaning for ip{E{K))). 

Set E* = max{E L /i : L C E(K V \ X ),\L\ < \E(K)\}. We assert that there is a 
positive constant e (depending only on H) so that (for large enough n), 

p d --'E* < n- e Eg. (50) 

(Of course we have already chosen G', but we now need to reexamine the random- 
ization that produced it to see what TZx says about the E^'s.) 

Before proving (50), we show that it gives (45) and (46). For (45) we need 

a~ 1 Eg = w(logn). 

We apply(50) with T = U, a d r -subset of V \ X (in which noted above, a\j 

is just Eug). We consider two possibilities. If Eg > n e / 2 then TZx gives (note here 

d r - I = 0) 

au < n £ / 4 max{l, E*} < n'^^Eg. 

Otherwise we have E* < n~ 6 / 2 . In this case, recalling that Eg = o>(logn) (see 
(44)), we can choose (3(n) = ui(l) with [3(n)~ 1 Eg = w(logn), and TZx guarantees 
that au < (3(n). So in cither case we have a~ 1 Eg = w(logn). Since none of these 
bounds depended on the choice of U, this gives (45). 

For (46) we need Et5 = n~ n ^Eg for any T as in (46). This follows from TZx if we 
assume, as we may, that the constant e in (50) is less than l/d(H), since we then 
have, e.g., 

E T g < p d r- l n e ' 2 max{l, E*} < n^^Eg. 



For the proof of (50), fix L C E(K V \ X ) and let k = \E(K)\ - \L\. We have E L h = 
p k NL, where Nl is the number of ip as in (49) with tp(E(K)) D L. In particular 
each such tp satisfies ip(V(K)) D W := T U V(L), where, as earlier, V{L) <ZV\X 
is the set of vertices incident with edges of L. Letting W — {w\, ... , w s }, we have 
Nl = Y^, Nl(P\, ... , b s ), where (&i, . . . , b s ) ranges over s-tuples of distinct elements 
of V(K) and the summand is the number of ip's as above with tp(bi) = Wi for i e [s]. 
Since there are only O(l) choices for the bi's, we will have (50) if we show (for any 
choice of frj's) 

p dr - l+k N L (h,... ,b s ) = n~ n ^Eg. (51) 
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Let (given bi's) H" = H[{a r , h, . . . , b s }]. Then 

JV L (6i, ... ,b s )< n"- T '- s = n v-r-{v(H")-i) ^ 

while k > I + d r+ i + • • • + — e{H") (since |-E : (-ft')| = d r+ i + • • • + d„_i and 
E(H") contains <^ _1 (L) and at least I edges joining a r to V(K)). Thus 

P ^- l+k N L (h,... ,b s ) <p «*r+-+<i-i n »-r[ n »(ff")-y(ff")]-l. 



Since iJ is strictly balanced (and since v(H") — 1 > I > 0), the expression in square 
brackets is n n ^\ so that, in view of (43), we have (51). 

This completes the proof of Lemma 10.2. 



11. Proof of Lemma 10.1 



We continue to use G, 1Z and A as in the preceding section. 



Assume that we have A and 1Z and that C fails at Y (see (34)). Let R = Y U {x} <G 
H (Y) satisfy w(R) = maxw(H {Y)). Note that 

w(i?) > n- 2 ("- 1 )$(G). (52) 



Choose y G y \ F with w(Y U {y}) < medw(H (^)), and with h(y,G - R) 
maximum subject to this restriction (see Section 6 for h), and set S = Y U {y}. 
Thus in particular 

w(R) > 2w(S). 



Note that the lower bound in (32) is now 

v — 1 , mn , . . 
nlogn H logp — U(n) 

v v 

(see (10) and (13)), while D — D{p) (as in Definition 8.1) satisfies 
log!? = (v — 1) logn + mlogp — 0(1). 

Thus A and (52) give 

v — 1 J7J11 

log$((G - R)> n log n H \ogp - 0(n), 

v v 

while from 1Z we have, using Lemma 6.1, 

log<I>(G -R)< ^[log((l + o(l))D) + h(y, G - R)}. 

(In more detail: Lemma 6.1 gives log$(G — R) < J2zev\R ^( z ' G— R); and we 
have h(z, G-R) < h(y, G - R) for at least half the z's in V \ R, and h(z, G-R)< 
logD(z, G — i?) < log((l + o(l))D) for every z, using 1Z and the fact that for any 
random variable k, H(k) < log |range(«)|.) 
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Combining (and rearranging) we have (again using 1Z) 

h{y,G-R) > («-l)logn + mIogp-0(l) > log D G -«(y) - 0(1). 

(53) 

Let W = V \ (Y U {a;, y}) and for Z E (^J set 

w'(2) = $(G-(fUZ)). 

The corresponding weight functions w y on W(y, G — R) and w x on H(x, G — S) are 

w„(A-) = w'(V(K) \ {y}) and w x (^) = W(V(K) \ {x}). 

Thus X(y, G—R) (see Section 6) is chosen according to the weights w y , and similarly 
for X(x, G - S). Note also that w y (H(y, G - R)) = w(R) and w x (H(x, G - S)) = 
w(S). 

According to Lemma 6.2, (53) implies that there are a,b E range(w y ) (= range(w')) 
as in (20) for which J := w^ 1 ([a, b]) satisfies 

\J\>Sl(\H(y,G-R)\) (54) 

and 

w„(J) > .7w y (H{y, G-R)) = .7w(R). 
Setting J' = w~ 1 ([a, b]), we thus have 

w„(J) > .7w(R), (55) 

while 

w x (J') < w(S) < .5w(i?). (56) 

Once we condition on the value of G[W], w 2/ ( J) and w x ( J') are naturally expressed 
as evaluations of a multilinear polynomial in variables : u € W^}, as follows. 
Given U C W with |t/| < v - 1, let G* v be the graph obtained from G[W] by 
adjoining a vertex w* with neighborhood U. Let /Cj/ be the set of copies of H in 
Gy containing {w*u :u£ [/}, and 

a. = £{w'(F(JQ \ {«;*}) : if e ACy, w'(F(tf) \ {w*}) E [a, 6]}. (57) 

The desired polynomial is then 

g(t) = a utu, 

ucw 

and w y ( J) and w x (J') are g evaluated at t' := l{ z ew.yze E(G)} an d f := ^-{zew-.xzeE(G)}- 

Now 7?. implies \H{y, G — R)\ = Q(n v ~ 1 p m ) = w(logn). (In more detail: 7Z implies 
that Dc(y) = Q(n v ~ 1 p m ) and, as is easily seen, that the number of copies of H in 
G containing y and meeting R is o(n v ~ 1 p m ).) Thus by (54) we also have 

I j| = e(n"-y"), 

and (since 6 < 0(a)) 

w„(j) - e^-y")- (58) 
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The key idea is now that we can use Corollary 5.7 to deduce that, unless something 
unlikely has occurred, Eg must be of a similar size. 

Fix T C W, say with \T\ = I < v. For d = I, . . . , v - 1, and t as in (47) with V\X 
replaced by W, we consider the polynomial 

h S) = t<p(E(H-z)) , (59) 

z if 

where z ranges over vertices of H of degree d and ip over injections V(H) \ {z} — > W 
with if(N z ) D T. Then 

ax < b ■ hi(t) 

("<" because of the restriction w'(V(K) \ {w*}) e [a, b] in (57)) and 
Let 

EJj = maxjEL/id : L C E(iifw), |i| < m - rf}. 

We assert that there is a positive constant e (depending only on H) so that (for 
each d), 

p d ~ l E* d < n- e n v - x V m - (60) 

The proof of this is essentially identical to that of (50) and we omit it. (Actually 
(60) is contained in (50): it's enough to prove (60) with replaced by E*(z) 
gotten by replacing hd in the definition of EjJ by the inner sum in (59); but this 
inner sum is bounded by the polynomial h(t) in (48) with r = 1, a\ = z and 
d r = d ("bounded by" rather than "equal to" because we've replaced V \ {x\} by 
the slightly smaller W); and, finally, note that we actually proved (50) with Eg 
replaced by n(p d -+-+ d -^n v ^ r ) (see (43).) 

Now intending to apply the results of Section 5, we consider / = oT x g, where 
a = m&x u a u . Then 

f(t') = a^WyiJ) = 9(c t - 1 k"-y) 
(see (58)), and we have 

f(t')=u(\ogn) (61) 

and 

max{E^/ : T C W,T ^ 0} = n~ n ^ f{t'). (62) 

These are derived from (60) in the same way as (45) and (46) were derived from 
(50), and we will not repeat the arguments. 

In summary, ARC implies that there are Y, x,y and (with notation as above) 
a, b e range(w') for which we have (61), (62), and, from (55) and (56), 

/(*") < W)- (63) 
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But for given Y, x, y, a and 6, / depends only on G[W]. On the other hand, given 
G[VF], t and t' are independent r.v.'s, each with law Bin(W,p) (where we say t — 
(t w : w £ W) has "law Bin(W,p)" if the t w 's are independent, mean p Bcrnoullis). 
Thus the following simple consequence of Theorem 5.6 and Corollary 5.7 applies. 



Claim. For any e > and d the following holds. If f is a multilinear, normal 
polynomial of degree at most d in n variables, ((n) = w(logn), and t' , t" are 
independent, each with law B'm([n],p), then 

Pr(/(0 > max{C(n),n £ maxE' f,{l + e)f{t")}) = n~ w ^ . 

Since there are only polynomially many possibilities for Y, x, y, a and b, this gives 
Lemma 10.1. 



Proof of Claim. Set 

A = - max{C(n), n e max E' T f }. 

If E/ < A then Corollary 5.7 gives 

Pr(/(t / ) > A) =n-^; 

otherwise, by Theorem 5.6, 

Pr(/(f ) > (1 + e).f{t")) < Pr(max{|/(i') - E/|, \f(t") E/|} > ( £ /3)E/) 

= n-^\ 



12. Extensions 



As mentioned earlier, we will not repeat the above arguments for general graphs 
or for hypergraphs; but we do want to stress here that "repeat" is the right word: 
extending the proof of Theorem 2.4 to produce (the counting versions of) Theorems 
2.2, 2.5 and 2.7 involves nothing but some minor formal changes. To elaborate 
slightly: 

While strict balance is used repeatedly in the proof of Theorem 2.4, a little thought 
shows that all these uses are of the same type: to allow us to say that, for some 
proper subgraph H' of H, and p in the range under consideration, 

n »(*')-y(ff') =n n « (64) 

as follows from 

e(H')/(vH') - 1) < m/(v- 1) (65) 

(this is what we get from strict balance) and the fact that p > n -1 ^^ (of course 
it is actually somewhat bigger). See the last few paragraphs of Section 5. 

For general graphs we no longer have (65) but can still guarantee (64) — though, 
of course, at the cost of some precision in the results — by taking p > n _1 / d *( ff ' +e 
for some fixed positive e. This leads to Theorem 2.2. 
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The extensions to hypergraphs are similarly unchallcnging, basically amounting 
to the observation that our arguments really make no use of the assumption that 
edges have size two. 

13. Appendix A: equivalence of Theorems 2.3 and 2.4 

Of course we only need to show Theorem 2.4 implies Theorem 2.3. 

Set i?(n) = n- 1 /d(ff)(io gn ) 1 /™ an d n(n,p) = (n v - 1 p m ) n / v , and write G for G(n,p) 
(for whatever p we specify) . 

Given K, we would like to show that there is a Ck for which p = p(n) > CK$(n) 
implies 

Pr($(G) < i^(n,p)e- CKn ) < rT K Vn. (66) 

Suppose this is not true and for each C, n define gc{n) by: if p = p(n) = <7c(ro)#(n) 
then 

Pr($(G(n,p)) < n(n,p)e- Cn ) = n~ K . 

If the sequence {gc( n )}n is bounded for some C then we have (66) with Ck the 
maximum of C and the bound on gc- 

So we may assume that {gc{ n )}n is unbounded for every C. We can then choose 
n\ < ri2 < • • • so that gc(nc) > C for C = 1, 2, 

Define the sequence {g(n)} by <ji(n) = gc{ n c), where C is minimum with n c > n. 
Since g(n) — > oo, Theorem 2.4 says that there is a C* so that if 

p = p{n) = g{n)d{n) (67) 

then 

Pr(*(G(n,p)) < i u(n,p)e" c * 11 ) = n^f 1 '. 
Now choose no so that for n > no and p as in (67) 

Pr(*(G(n,p)) < M («,p)e- C *") < 
and then C > C* with n c > n . Then for p still as in (67) and n = nc we have 

Pr($(G(n,p)) < Ai(n,p)e" c ") < Pr($(G(n,p)) < Ai(n,p)e~ c *") < n~ K . 

(68) 

But this contradicts our definition of g, according to which the l.h.s. of (68) is n~ K . 
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